Discovering Planar Disorder in Close-Packed Structures from X-Ray Diffraction: 

Beyond the Fault Model 
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We solve a longstanding problem — determining structural information for disordered materials 
from their diffraction spectra — for the case of planar disorder in close-packed structures (CPSs). 
Our solution offers the most complete possible statistical description of the disorder, and, from it, 
we find the minimum effective memory length of disordered stacking sequences. We also compare 
our model of disorder with the so-called fault model (FM) and demonstrate that in simple cases 
our approach reduces to the FM, but in cases that are more complex it provides a general and 
more accurate structural description than the FM. We demonstrate our technique on two previously 
published zinc sulphide diffraction spectra. 



PACS numbers: 61.72.Dd, 61.10.Nz, 61.43.-j, 81.30.Hd 

Describing the structure of solids — by which we sim- 
ply mean the placement of atoms in (say) a crystal — is 
essential to a detailed understanding of material prop- 
erties. Crystallography has long used the sharp Bragg 
peaks in X-ray diffraction spectra to infer crystal struc- 
ture. For those cases where there is diffuse scattering, 
however, finding — let alone describing — the structure of 
a solid has been more difficult |jl[. Indeed, it is known 
that without the assumption of crystallinity, the infer- 
ence problem has no unique solution |2|. Moreover, dif- 
fuse scattering implies that a solid's structure deviates 
from strict crystallinity. Such deviations can come in 
many forms — Schottky defects, substitution impurities, 
line dislocations, and planar disorder, to name a few. Of 
these, here we consider only planar disorder; that caused 
by one plane of atoms slipping relative to another in a 
layered material. This kind of disorder is known to be 
prevalent in a broad class of materials called polytypes. 

First discovered in SiC by Baumhauer ^ in 1912, poly- 
types H, H are solids built up from identical layers, called 
modular layers (MLs) that differ only in their stack- 
ing orientation. A polytype is simply described by its 
stacking sequence — the one-dimensional list of successive 
orientations found as one moves along the stacking direc- 
tion. We refer to the effective stochastic process induced 
by scanning the list as the stacking process. In the Hagg 
notation for stacking sequences one replaces the set 
{A, B, C} of allowed orientations with a binary alphabet 
{0, 1}: an ML is labeled '1', if it is cyclically related 
to the preceding ML, or 0, if it is not. 

Polytypism is found in dozens of materials; one of the 
best studied is ZnS. There are approximately 185 identi- 
fied crystalline structures Q and many samples exhibit 
varying degrees of disorder. Notably, some ZnS crystals 
have unit cells extending over 100 MLs These differ- 
ent stacking sequences can occur under virtually identical 
thermodynamic growth conditions. The mystery of poly- 
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typism then is two-fold: How can so many different struc- 
tures (crystalline and noncrystalline) exist? And, what 
are the source and range of interlayer interactions that 
produce these structures? Over the last fifty years, con- 
siderable effort has been expended to understand poly- 
typism, with over a dozen theories having been proposed; 
but a general explanation is still lacking ^ . 

Attempts to describe planar disorder in CPSs have a 
long history. Early studies @, ||] focused on stacking 
errors or faults that permeated a parent crystal. Dif- 
ferent kinds of stacking faults were postulated, such as 
growth faults, deformation faults, and layer displacement 
faults 1^. In this fault model (FM) theory, stacking faults 
were introduced randomly into the parent crystal and 
their effect on the intensity, placement, and broaden- 
ing of Bragg peaks was calculated as a function of the 
fault frequency. These efforts met with good success 
for several weakly faulted specimens as such cobalt ||] 
and lithium However, for polytypes such as ZnS 
and SiC, the random insertion of faults often did not 
describe the observed Bragg peaks well. More sophisti- 
cated models |, |l^, were introduced which attempted 
to account for nonrandom fault insertion by assuming the 
existence of some "coordination" between faults. These 
more complicated models gave mixed results. 

We find several drawbacks to the FM. The first is the 
need to assume a single parent crystalline structure into 
which stacking defects are introduced; this precludes the 
description of disorder interspersed between distinct crys- 
tal structures. In some polytypes, such as ZnS, for ex- 
ample, there is considerable interest in characterizing the 
transformation between the hexagonally closed-packed 
(HCP) structure and the cubic close-packed (CCP) struc- 
ture when the crystal is subjected to an external stress, 
such as annealing j5|. In these cases, there is no single 
parent crystal in which to introduce faulting. A second 
drawback is not inherit to the FM, but in the way it is 
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analyzed |5j. By considering only the effects of faulting 
on Bragg peaks, information in the diffuse scattering is 
ignored. (Our second example below demonstrates how 
misleading this can be.) Our final difficulty is with the 
FM generally. Here we show that it is not possible to 
uniquely identify and assign faulting sequences to disor- 
dered crystals, except in those special cases to which the 
FM is restricted. 

In this Letter, we introduce a novel method for dis- 
covering and describing disordered stacking sequences 
in CPSs that overcomes these difficulties. We analyze 
two previously published diffraction spectra for polytypic 
ZnS H] and compare our results with those of the FM. 

Our method can be broken into three parts. In the 
first, we use a diffraction spectrum to find average cor- 
relations between MLs as a function of the number n of 
separating layers. If we assume that the MLs themselves 
are undefected, that each ML has the same scattering 
power, and that the spacing between MLs is indepen- 
dent of the local stacking arrangement, then correlation 
factors (CFs), Qc{n) and Qa{n) jl^, can be found by 
Fourier analysis of the diffraction spectrum Qc(n) 
and Qa{n) are defined as the probability that any two 
MLs at separation n are cyclically or anticyclically re- 
lated, respectively. 

In the second part of our approach, we infer the spa- 
tial patterns of MLs that reproduce these CFs by recon- 
structing an e-machine , which describes the minimal 
effective states of the stacking process. Assume we know 
the probability of stacking sequences cu. At each ML 
in a stacking sequence define the "past" oj as those MLs 
already seen and the "future" w as those yet to be seen: 
u! =ujoj. The effective states of the stacking process then 
are defined as the sets of pasts uj that lead to statistically 
equivalent futures: 

LUir-^ujj if and only if p{lu \ lo{) — p{oJ \ oJ j) . (1) 

These equivalence classes of pasts are the stacking pro- 
cess's causal states. Along with their transitions, they 
comprise the process's e-machine — a statistical descrip- 
tion of the ensemble of spatial patterns that produces the 
stacking distribution p{lo). It has been shown that the 
e-machine is the optimal predictor of minimal size (as 
measured by the number of states) of a process, and, up 
to state- relabeling, it is the unique such description [ p^ . 

To find the causal states we must first estimate the 
probability p{lj) of stacking sequences uj averaged over 
the sample. Note that, from conservation of probability, 
p{u) = p{Ou) + p{lu) = p{uO) +p{ul), for all u £ , 
where A'' is the set of all sequences of length r. Addi- 
tionally, the probabilities for sequences of the same length 
are normalized: J2ljgA^+^ Pi'^) = 1- Together these con- 
straints provide 2^ independent relations among proba- 
bilities for the 2''+^ possible stacking sequences of length 
r + 1. 



The other 2'" constraints come from relating CFs to 
sequence probabilities via 

g„(n) = J2 P(^) ' (2) 

where is that subset of length-n sequences with a 
cyclic (a — c) or an anticyclic {a ~ a) rotation between 
MLs at separation n. We take as many of these latter re- 
lations as necessary to form a complete set of equations. 
At a fixed r, the set of equations describes the stacking 
sequence as generated by an r*^-order Markov process. 
At r = 3 one encounters the first nonlinearities due to 
the necessity of using CFs at n = 5 to obtain a complete 
set of equations. We rewrite the probability of sequences 
of length n = 5 in terms of the conditional probabilities 
of those at n = 4, and it is this mapping that is non- 
linear. We solve numerically for the stacking sequence 
probabilities p{u!) and then find the set of causal states 
using the equivalence relation Eq. (|l|). The causal-state 
transitions are estimated from the conditional distribu- 
tions of the next ML orientation given pasts associated 
with each causal state. 

In the third and final part, we begin with the r = 1 re- 
constructed e-machine, use it to generate a sample stack- 
ing sequence (here we used length 400, 000), and from this 
we estimate the e-machine's predicted CFs and diffrac- 
tion spectrum. We then compare the latter to the ex- 
perimental diffraction spectrum. If there is not sufficient 
agreement, we increment r and repeat the reconstruc- 
tion and comparison. The resulting r is called the stack- 
ing process's memory length, since it is the amount of 
history (in MLs) one must use to optimally predict the 
process. 

ZnS can be thought to have a CPS with a basis com- 
posed of two atoms, zinc and sulphur, with the sulphurs 
displaced one quarter of a body diagonal (as referred 
to the conventional unit cell) along the stacking direc- 
tion We take an ML to be this zinc sulphur pair ar- 
ranged in a hexagonal net giving (as with any CPS) 
three absolute orientations for the MLs, but only two 
relative orientations for neighboring layers. We correct 
the experimentally obtained diffraction spectrum for the 
atomic scattering factors, the structure factor, dispersion 
factors, and polarization of the incident radiation [p^ . 

We now give the results for e-machine reconstruc- 
tion for two experimental diffraction spectra, SK134 and 
SK135 from Q. Let / be a continuous variable that in- 
dexes the magnitude of the perpendicular component of 
the diffracted wave k = 27r//c, where c is the spacing 
between adjacent MLs. We select a unit interval in / on 
which to analyze each spectra. Since many diffraction 
spectra suffer from experimental error we show else- 
where ||l5| that there are relations that the CFs must 
obey for any CPS and that we can use them to select a 
relatively error-free Z-interval. The spectra from experi- 
ment and e-machine reconstruction are normalized. 
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FIG. 1: Comparison of the experimental diffraction spectrum 
SK134 along the lOJ row (triangles) for a disordered ZnS 
single crystal p. 134] with spectra estimated from the 
FM with 5% deformation faulting (dashed line) and r — 3 
e- machine (solid line). The vertical scale in the inset is loga- 
rithmic intensity. 



The triangles in Fig. |l| show the experimental diffrac- 
tion spectrum SK134 along the 10. ^ row for an HCP ZnS 
crystal annealed at 300 C for one hour. Sebastian and 
Krishna ^ attribute the observed disorder to the intro- 
duction of 5% deformation faulting. (This is the FM 
predicted spectrum given as a dashed line in Fig. |^). We 
find that the smallest-r e-machine that gives adequate 
agreement (solid line) with experiment is estimated at 
r = 3; it is shown in Fig. ||. 

It is possible to give an approximate equivalent of this 
e-machine in terms of the FM, but we stress that this 
decomposition is not unique. We associate each closed, 
nonintersecting loop (called a simple cycle or SC p^) in 
the e-machine with either a crystal structure or a fault. 
In this way, e-machines directly describe familiar struc- 
tures in polytypes. For instance, the closed loop between 
causal states C and H in Fig. || implies a stacking se- 
quence . . . 010101 . . . , which is simply the Hagg nota- 
tion for the HCP structure. One concludes, then, that 
there is no qualitative difference between what one calls 
faults and crystal structure. The distinction is, in fact, 
quantitative and one of convenience — crystal structures 
have relatively high probabilities, as opposed to the rarer 
faults. For the most general r = 3 e-machine, it is known 
that there are 19 such SCs (l|. Since eight indepen- 
dent CFs are sufficient to specify an r = 3 e-machine, 
the problem of decomposing the e-machine into SCs is 
underdetermined. This conclusion holds for all r > 2. 
Therefore, without a fortuitous vanishing of causal states 
or transitions, the fault description is not unique. 

For the sake of comparison with previous FM analyses, 
we decompose the e-machine in Fig. |^ into SCs with the 
assumption that faults corresponding to SCs of length 7 
or greater are not present. We can then assign an ap- 
proximate fault distribution for SK134 (second column) 




FIG. 2: The recurrent causal states {A — H} of the recon- 
structed e-machine estimated from the experimental diffrac- 
tion spectrum SK134 of Fig. |l] with r — 3. Asymptotic state 
probabilities are given in parentheses; edge label s\p indicates 
a transition on symbol s with probability p. 

as follows and compare it to that of |5| (third column): 

HCP 64% 95% 

CCP 8% 0% 

Deformation Fault 16% 5% 

Growth Fault 6% 0% 

Layer Displacement Fault 6% 0% 

The e-machine description of the crystal differs signifi- 
cantly from that of Sebastian and Krishna [||. While 
we both find qualitatively that deformation faulting is 
important, we also detect CCP structures, as well as 
growth faults and layer displacement faults. Overall, e- 
machine analysis finds a much more disordered crystal. 
This is borne out when comparing the FM and e-machine 
diffraction spectra. Fig. |l| shows that, while both agree 
reasonably well with experiment at the broadened peaks 
at I = 0.5 and 1, the e-machine is in better agreement 
along the shoulders of the Bragg peaks, as well as at the 
rise in broadband intensity at I w 0.67 (inset in Fig. |l|). 

Fig. 1^ plots the experimental diffraction spectrum 
along the 10. ^ row (triangles) for a HCP ZnS crystal an- 
nealed at 500 C for one hour. Sebastian and Krishna ||] 
calculate a twin-fault probability of 12% from the ob- 
served half- widths of the peaks. The calculated diffrac- 
tion spectrum for such a faulting mechanism is shown in 
Fig. H (dashed line). Only the peak at I = —0.33 was 
used to find the faulting mechanism, and one sees that 
the FM reproduces it well. However, the second peak at 
I = —0.67 is poorly represented, as is the diffuse scat- 
tering between the two peaks. This demonstrates the 
pitfalls in simply fitting an FM to a single Bragg peak, 
ignoring the information contained in other peaks and in 
the diffuse scattering. We also note that the small rise 
in diffracted intensity at / « —0.16 is likewise missed by 
the FM. The e-machine spectrum (solid line) also misses 
this rise, but otherwise is in excellent agreement with the 
experiment. Fig. H shows the reconstructed e-machine 
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FIG. 3: Comparison of the experimental diffraction spectrum 
SK135 along the lO.l row (triangles) for a disordered CCP ZnS 
single crystal js], p. 135] with the diffraction spectra calculated 
from the the FM with 12% twined faulting (dashed line) and 
r — 3 e- machine (solid line). 



obtained at ?■ = 3. The large probabilities for causal 
states A and F and their large self-loop transition prob- 
abilities, associated with stacking sequences ... 1111 .. . 
and . . . 0000 . . . , indicate that this is a twined-CCP crys- 
tal. The missing H ^ C causal-state transition, and so 
the resulting absence of the . . . 0101 . . . stacking, implies 
that the original HCP structure has been eliminated. 

In conclusion, we presented a novel method for discov- 
ering and describing planar disorder and organization in 
CPSs. We demonstrated that the FM, both in concep- 
tion and practice, is unable to accommodate the variety 
of stacking arrangements possible in nature. In contrast, 
e-machines provide a unique description of structure and 
can be used to quantify any amount of disorder. We 
argued that simply examining the effects of disordered 
stacking on the Bragg peaks is insufficient to properly 
detect the disorder present. Moreover, we quantified the 
memory length for disordered ID systems; for the ZnS 
samples considered it was 3 MLs. Thus, the origins of 
polytypic ordering lie outside the physical ID Hamilto- 




nian, from which one calculates a physical interaction 
range of 1 ML ||. 

While we performed e-machine reconstruction only to 
r = 3, the extension to higher r is straightforward, but 
quite demanding computationally. Additionally, know- 
ing the e-machine for a material's spatial patterns al- 
lows calculation of physically relevant quantities. We 
show elsewhere that given the coupling constants be- 
tween MLs [|l^, we can determine the average stacking- 



fault energy for a disordered crystal |15 . We expect that 
other physical parameters will be amenable to calculation 
directly from e-machines. 
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FIG. 4: Recurrent states of the reconstructed e-machine for 
the experimental diffraction data SK135 of Fig. fusing r = 3. 
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